Application of Bayesian networks to two classification problems in bioinformatics
نویسنده
چکیده
The application of machine learning techniques to bioinformatics problems has become increasingly popular in recent years. Of particular interest are probabilistic graphical models since they provide a concise representation for inferring models from data. Current applications include the learning of gene regulatory networks (Friedman, 2004) and protein function prediction. Bayesian networks are the subset of probabilistic graphical models that can be expressed as directed acyclic graphs (Jordan, 1996; Jensen, 2001). They use a combination of domain knowledge and data to provide a framework that can be used to model the relationships between sets of variables in a probabilistic manner. They are a particularly powerful tool in bioinformatics since they can handle incomplete data sets allowing the building of a model from a training set with (different) missing data, or facilitating the prediction of a variable’s state/value based on a restricted set of evidence. For example, an important variable may be unknown during testing, yet marginal probabilities can still be calculated. Causal relationships between variables can also be learnt giving us information on the contribution of each variable to the prediction, and any independencies between variables can be exploited to reduce the complexity of the model. In this work, we use Bayesian networks for two classification tasks in bioinformatics. Our results show an improvement over previous machine learning methods applied to these problems. This improvement may be due to the Bayes nets’ ability to capture the interactions between multiple variables. Preliminary results show that the framework of probabilistic graphical models provides a good basis for modelling bioinformatics data.
منابع مشابه
A Surface Water Evaporation Estimation Model Using Bayesian Belief Networks with an Application to the Persian Gulf
Evaporation phenomena is a effective climate component on water resources management and has special importance in agriculture. In this paper, Bayesian belief networks (BBNs) as a non-linear modeling technique provide an evaporation estimation method under uncertainty. As a case study, we estimated the surface water evaporation of the Persian Gulf and worked with a dataset of observations ...
متن کاملA Surface Water Evaporation Estimation Model Using Bayesian Belief Networks with an Application to the Persian Gulf
Evaporation phenomena is a effective climate component on water resources management and has special importance in agriculture. In this paper, Bayesian belief networks (BBNs) as a non-linear modeling technique provide an evaporation estimation method under uncertainty. As a case study, we estimated the surface water evaporation of the Persian Gulf and worked with a dataset of observations ...
متن کاملA Probabilistic Model for COPD Diagnosis and Phenotyping Using Bayesian Networks
Introduction: This research was meant to provide a model for COPD diagnosis and to classify the cases into phenotypes; General COPD, Chronic bronchitis, Emphysema, and the Asthmatic COPD using a Bayesian Network (BN). Methods: The model was constructed through developing the Bayesian Network structure and instantiating the parameters for each of the variables. In order to validate the achiev...
متن کاملA Bayesian Networks Approach to Reliability Analysis of a Launch Vehicle Liquid Propellant Engine
This paper presents an extension of Bayesian networks (BN) applied to reliability analysis of an open gas generator cycle Liquid propellant engine (OGLE) of launch vehicles. There are several methods for system reliability analysis such as RBD, FTA, FMEA, Markov Chains, and etc. But for complex systems such as LV, they are not all efficiently applicable due to failure dependencies between compo...
متن کاملApplication of Artificial Neural Networks in a Two-step Classification for Acute Lymphocytic Leukemia Diagnosis by Blood Lamella Images
Introduction: This study aimed to present a system based on intelligent models that can enhance the accuracy of diagnostic systems for acute leukemia. The three parts including preprocessing, feature extraction, and classification network are considered as associated series of actions. Therefore, any dysfunction or poor accuracy in each part might lead in general dysfunction of...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005